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^ In this paper I introduce quantile spectral densities that summarize the cychcal 

O behavior of time series across their whole distribution by analyzing periodicities in 

quantile crossings. This approach can capture systematic changes in the impact 
^ of cycles on the distribution of a time series and allows robust spectral estima- 

tion and inference in situations where the dependence structure is not accurately 
captured by the auto-covariance function. I study the statistical properties of 
quantile spectral estimators in a large class of nonlinear time series models and 
I— —I discuss inference both at fixed and across all frequencies. Monte Carlo experiments 

E"' illustrate the advantages of quantile spectral analysis over classical methods when 

^ standard assumptions are violated. 

■3 

a JEL classification: C15, C18, C22, E32 

^ Keywords: spectral density, spectrum, robust estimation, robust inference, quantiles 

^ 1. Introduction 

in 

^ Classical spectral analysis uses estimates of the spectrum or spectral density, a weighted 

^ sum of auto-covariances, to quantify the relative magnitude and frequency of cycles 

^ present in a time series. However, if the dependence structure is not accurately captured 

^ by the auto-covariance function, for example, because the time series under consideration 

^ is uncorrelated or heavy-tailed, then spectral analysis can provide only uninformative 

^ or even misleading results. In this paper I discuss estimation and inference for a new 

•i-j class of spectral densities that summarize the cyclical behavior across the whole dis- 

*^ tribution of a time series by analyzing how frequently a process crosses its marginal 
quantiles. Functions from this class, which I refer to as quantile spectra or quantile 
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spectral densities, are similar to classical spectral densities in both shape and interpre- 
tation, bnt can capture systematic changes in the impact of cycles on the distribution 
of a time series. Such changes arise naturally in a variety of modern time series models, 
including stochastic volatility and random coefficient autoregressive models, and cannot 
be identified through classical spectral analysis, where cycles are assumed to be global 
phenomena with a constant effect on the whole distribution. Quantile spectral analysis 
fundamentally changes this view because it distinguishes between the effects of cycles at 
different points of the distribution of a process and permits a local focus on the parts of 
the distribution that are most affected by the cychcal structure. 

Spectral analysis has traditionally played an important role in the analysis of economic 
time series; see, among many others. Granger (1966), Sargent (1987, chap. 11), Diebold, 
Ohanian, and Berkowitz (1998), and Qu and Tkachenko (2011), where the shape of 
the sample spectral density is typically taken to be one of the "stylized facts" that the 
predictions of a model must match. For macroeconomic data, these stylized facts often 
refer to high-frequency (seasonal) and low-frequency (business cycle) peaks in the spec- 
trum. However, both observed data and the posterior distributions of economic models 
can exhibit heavy tails (Cogley and Sargent, 2002) that can induce peaks at random in 
the sample spectra of the data and the model output, invalidating comparisons between 
the two. For financial data, the stylized facts include the absence of auto-correlation, 
i.e., peakless spectra, and heavy-tailed marginal distributions (Cont, 2001). Stochastic 
volatility models such as GARCH processes (BoUerslev, 1986) can cross almost every 
quantile of their distribution in a periodic manner and at the same time satisfy these 
and other stylized facts, leading the researcher to incorrectly conclude from the spec- 
trum that no periodicity is present. Bispectra and higher-order spectra can possibly 
detect cycles in quantile crossings, but rely on the presumption of light tails since they 
require the existence of at least third moments to be well defined and sixth moments to 
be estimated reliably (see, e.g., Rosenblatt and Van Ness, 1965). Financial time series 
such as log-returns of foreign exchange rates or stock prices may lack finite fourth or 
even third moments (Loretan and Phillips, 1994; Longin, 1996). 

My proposed approach is robust to each of these concerns: Quantile spectral methods 
consistently recover the spectral shape and detect periodicities even in uncorrelated or 
heavy-tailed data. Inference about quantile spectra both at fixed frequencies and across 
frequencies does not require assumptions about the moments of the process. Although 
moment conditions can be used to verify some of the assumptions below, arbitrarily 
small fractional moments suffice. Because several common time series models can in- 
duce situations where cycles are present at some but not at all quantiles, I also provide a 
general Gramer-von Mises specification test for peakless quantile spectra. Under condi- 
tions that are routinely imposed in the literature when testing for the absence of peaks, 
these tests are distribution-free and, depending on the strength of the assumptions, 
sometimes even exact in finite samples. The test remains valid asymptotically under 
much weaker conditions when a bootstrap approximation is used. 

Several recent papers apply quantiles in spectral or correlogram (auto-correlation) 
analysis. Li (2008, 2011) obtains robust spectral estimators via quantile regressions for 
harmonic regression models. Although his estimation method is quite different from 
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that developed here, there is some overlap in our results. I provide a detailed discussion 
in section 3. Katkovnik (1998) relies on the same idea as Li (2008), but only works 
with sinusoidal models and iid noise. Linton and Whang (2007) introduce the "quan- 
tilogram," a correlogram that is essentially the inverse Fourier transform of a quantile 
spectrum, but their focus is on testing for directional predictive ability of financial data 
in the time domain, rather than spectral analysis. Chung and Hong (2007) test for di- 
rectional predictive ability with the generalized spectrum (Hong, 1999) by investigating 
the frequency domain behavior of processes around a given threshold. This approach is 
similar in spirit to quantile spectral analysis but, as Linton and Whang point out, Chung 
and Hong rescale their data with sample standard deviations but do not account for the 
randomness introduced by the rescaling in the derivation of their tests. In contrast, the 
scaling of the data for quantile spectral analysis is provided automatically through the 
marginal quantile function and all of my results are derived under the assumption that 
these quantiles are estimated. 

Other robust spectral methods are discussed by Kleiner and Martin (1979) and Kliip- 
pelburg and Mikosch (1994): Kleiner and Martin focus on time series where the depen- 
dence structure is accurately captured by an autoregressive model of sufficiently high 
order. Quantile spectral analysis differs from these methods in that it is completely non- 
parametric and, most importantly, it robustly estimates cyclical components even when 
an autoregression is not an appropriate model for the data. Kliippelburg and Mikosch 
robustify classical spectral estimates by a self-normalization procedure to estimate nor- 
malized spectra under arbitrarily weak fractional moments conditions. However, their 
results are of limited use for applications because little is known about the asymptotic 
distribution of their procedures. In contrast, I show that quantile spectral estimates 
have relatively simple asymptotic distributions even when no moments exist. 

The remainder of the paper is organized as follows: Section 2 discusses quantile spec- 
tral analysis and introduces two classes of estimators. Section 3 establishes the asymp- 
totic validity of the estimators under weak regularity conditions. In section 4, I show 
the consistency of Cramer-von Mises tests for peakless spectra. The Monte Carlo ex- 
periments in section 5 illustrate the finite sample properties of the estimators and tests. 
Section 6 concludes. The Appendix contains auxiliary results and proofs. 

I use the following notation throughout the paper: 1{-} and 1{.} both represent the 
indicator function and ||X||p denotes (E|X|*')-'^/*'. Limits are as n — )> oo unless otherwise 
noted and convergence in distribution is indicated by The inner product (■, ■)n and 
norm || ■ ||n are defined at the beginning of section 4. 

2. Quantile Spectra and Two Estimators 

This section introduces quantile spectral density estimation as a robust complement 
to classical spectral methods. Spectral analysis aims to reveal periodic behavior in a 
stationary time series Xt with auto-covariance function 7x(i) '^XqXj — (EXq)^ at 
lag j by estimating the spectrum or spectral density at frequency A, defined as 



3 



The auto-covariance function is typically taken to be absolutely summable to ensure 
that fx is continuous and symmetric about 0; a stochastic process that does not possess 
at least finite second moments cannot be analyzed by the spectrum. If fx has a peak at 
A, then Xt is expected to repeat itself on average after 27r/A units of time; for example, a 
monthly time series with a peak in the spectrum at 27r/3 has a three-month cycle, with a 
higher value of fx corresponding to a more pronounced cycle. The primary goal of this 
paper is to develop spectral methods that go a step beyond summarizing the average 
impact of cycles by distinguishing between the effects of cycles at different points of the 
distribution of Xf. 

The central idea is that if a stationary process {Xt)t^z contains cycles, then its real- 
izations will tend to stay above or below a given threshold in an approximately periodic 
manner. The pattern in which the process crosses a threshold at the center of its dis- 
tribution refiects the most prominent cycles, but provides little information about their 
relative sizes. Patterns in threshold crossings near the extremes of the distribution help 
to identify amplitudes of these cycles and they also recover periodicities that are obscured 
at the center of the distribution. The quantiles of Xt, arising from the quantile function 
^o(t) := inf{x : F{Xq < x) > t}, are natural choices for such thresholds because they 
give precise meaning to the notion of the center and extremes of a distribution. Spectral 
analysis of quantile crossing patterns can then discover cycles in the process and reveal 
the extent to which they are present at a given quantile without relying on moments. 

To formalize this idea, pick probabilities r G (0, 1) corresponding to the quantiles 
^o(t). The variable of interest for the analysis is 



such that Vt{r) := Vt{r, ^,o{t)) takes on the value r — 1 if is below its r-th quantile at 
t, and r otherwise. Here the quantiles are not assumed to be known, which enables the 
researcher to choose the values of r according to the amount and nature of information 
that is needed about the cychcal structure of the time series. For example, r = 0.5 only 
analyzes fluctuations about the median, whereas varying r between 0.5 and 0.9 also 
provides information about the positive amplitudes by including values near the upper 
extremes of the process. 

If the distribution function of Xt is continuous and increasing at ^0(7"), then the r-th 
quantile crossing indicator T4(t) is a bounded, stationary, mean-zero random variable 
with auto-covariance function r^(j) := 7y(^)(j) — EVo(t)T^(t). Periodicities in T4(t) are 
summarized by peaks in its spectral density 



which I refer to as the r-th quantile spectrum or r-th quantile spectral density in the 
sequel. Analyzing gr across a grid of probabilities r G (0, 1) therefore reveals cycles in 
events of the form {Xt < Co{t) : ^ G Z}, which in turn summarize {Xt)te2, with arbitrary 
precision as long as the grid is fine enough. 



Vt{T,0 = r-l{Xt<0, 



(r,Oe(0,l)xM, 




(2.2) 
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As the next two Examples show, quantile spectral analysis can in fact yield additional 
insights beyond classical spectral analysis; Linton and Whang (2007) consider similar 
models. I discuss estimation of quantile spectra below. 

Example 2.1 (Stochastic volatility). Let {et)tez be a sequence of iid mean-zero random 
variables and suppose the data are generated by the stochastic volatihty model Xt — 
Co('^o) + ^ti^i^t-i, ■ ■ ■ ) for some Tq G (0, 1), where > is a measurable function 
that drives the volatility of the process. If has finite second moments, then it is an 
uncorrelated time series and its spectrum contains no information about the dependence 
structure beyond that it is "flat," i.e., /x(A) = 7x(0)/(27r) at all frequencies. 

However, any stationary time series with a continuous and increasing distribution 
function at ^o{t) satisfies rT-(O) = t(1 — r) and the stochastic volatility process also has 
the property that 

rr,ij) = EK)(ro)(ro - F{X, < eo(ro) | ...)) = (tq - P(£, < 0))EVo(ro) = 

for all j > 0. Therefore its Tg-th quantile spectrum will also flat in the sense that 
(7^y(A) = ro(l — ro)/(27r), but the other quantile spectra of the stochastic process will be 
informative because r^(j) generally does not vanish for r ^ tq. 

Example 2.2 (QAR). Now let {et)t£Z be a sequence of independent Uniform(0, 1) vari- 
ables and consider the second-order quantile autoregressive (QAR(2)) process of Koenker 
and Xiao (2006), 

Xt = (3o{et) + i3i{et)Xt-i + (32{et)Xt-2 = E{/3,{eo))Xt-i + E(/32(£o))^i-2 + Yt, 

where Yt = /3o{et) + [/3i{et) - E{/3,{eo))]Xt-, + [/32{et) - E(/32(£o))]^t-2. Here /3o, A, 
and P2 are unknown functions that satisfy regularity conditions that ensure stationarity 
and Xt is assumed to be increasing in et conditional on Xt^i, Xt-2- Provided that 
its second moments exist, the sequence (l^)tez has no influence on the shape of the 
spectrum because it is an uncorrelated sequence that is also uncorrelated with the other 
variables on the right-hand side of the preceding display (Knight, 2006). Hence, if 
E/3i(£:o) = E/32(£o) = 0, the spectrum of Xt satisfies /x(A) = 7y(0)/(27r) and classical 
spectral analysis cannot reveal anything about cycles in Xt. If E/3i(£o) and Kf32{so) are 
nonzero, then the spectrum of Xt is the same as that of an AR(2) process with the 
same mean and covariances as the QAR(2). If, instead, there is some tq G (0, 1) such 
that /3i(ro) = /92(ro) = 0, then the ro-th quantile spectrum is also fiat (see Example 4.3 
below), but cycles can be recovered at other quantiles. Further, the quantile spectra of 
the QAR(2) process and those of an AR(2) process with the same mean and covariance 
structure will generally be different. 

For a given sample §„ := {Xt : t = 1, . . . ,n}, I consider two estimators of the quantile 
spectrum that correspond to the periodograms and smoothed periodograms used in 
classical spectral analysis. The key difference from the classical case is that the variable 
of interest \4(r) is indexed by the unknown quantity C,o{'^) and therefore itself has to be 
estimated. To this end, let Cn('?") be the r-th sample quantile determined implicitly by 
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solutions to the minimization problem 



mm 

t=i 



where Pt{x) := x{r — l{x < 0}) is the Koenker and Bassett (1978) check function. Let 
Vti^r) := Vt{T, 'Cn(T)) be the estimate of Vt(r). The T-th quantile periodogram is then the 
"plug-in" estimator 

Qn,rW ■= ^ n-'/' Vt{r)e-^'' ' = ^Y. cos(jA), (2.3) 



27r 

«=1 \j\<n 



where i := and Tn^rij) '■= "^"^ X^tLlil+i bl < As 1 will show 

in the next section, the quantile periodogram inherits the properties of the classical 
periodogram in the sense that it allows the construction of valid confidence intervals, 
but does not provide consistent estimates for the spectrum of interest. 

Consistent estimation of the quantile spectrum requires additional smoothing to assign 
less weight to the imprecisely estimated auto-covariances with lags \j\ near n. For this 
1 apply the Parzen (1957) class of kernel spectral density estimators to the present 
framework. The estimators, which 1 refer to as smoothed T-th quantile periodograms, are 
given by 

^n,r(A) = ^J2 ^U/Bn)rnAj) COs{jX), (2.4) 

\j\<n 

where is a scalar "bandwidth" parameter that grows with n at a rate specified in 
Theorem 3.6 below, and w is a real- valued smoothing weight function from the set 

W := I'M; is bounded and continuous, w{x) — w{—x) Mx e M, 

w{Qi) — 1, w{x) :— supy^^ satisfies w{x) dx < oo, 

W{X) := (27:)-^ J^^w{x)e-'''^ dx satisfies \W{X)\dX < oo}. 

In the literature, w and W are usually called the lag window and spectral window, 
respectively. Both functions are also often referred to as kernels, although w does not 
necessarily integrate to one. 

Remarks. 1. The class W includes most of the kernels that are used in practice, for 
example the Bartlett (i.e., triangular), Parzen, Tukey-Hanning, Daniell, and quadratic- 
spectral windows. However, it excludes the truncated (also known as rectangular or 
Dirichlet) window. See Andrews (1991) and Brockwell and Davis (1991, pp. 359-362) for 
thorough descriptions of these windows and their properties. I provide a brief discussion 
on how to choose w and B„ at the end of the next section. 

2. The restriction w{x) < oo is not standard in the spectral density estimation 



6 



literature. As pointed out by Jansson (2002). it is needed for asymptotic bounds on 
expressions such as B^^^y^^j^\w(j /Bn)\ that typically arise in consistency proofs of 
spectral density estimates indexed by estimated parameters; see also Robinson (1991). 

3. Spectra are non-negative. It is therefore common practice to choose a window such 
that 1^ > to ensure non-negativity of the spectral density estimate; see, e.g., Andrews 
(1991) and Smith (2005). The condition rfA < oo is immediately satisfied for 

such windows in view of the inverse Fourier transform w{x) = e*^^Vr(A) dX, which 
implies W{\) d\ = w{0) = 1 for w G W. The Tukey-Hanning window is an example 
of a window that satisfies |1^(A)| d\ < oo, but not 1^ > 0. 

The next section characterizes the asymptotic properties of the quantile and smoothed 
quantile periodograms. 

3. Asymptotic Properties of Quantile and Smoothed Quantile Periodograms 

In this section 1 construct confidence intervals for the quantile spectrum and establish 
the consistency of the smoothed quantile periodogram under regularity conditions. I 
also compare the quantile periodogram to the periodograms of Li (2008, 2011). 

Throughout the remainder of the paper I assume that {Xt)^^^ is a nonlinear process 
of the form 

Xt^Y{et,et-i,et-2,...), (3.1) 

where {st)ez is a sequence of iid copies of a random variable e and y is a measurable, 
possibly unknown function that transforms the input '■= {^t, £t-i, • • • ) into the output 
Xf. The class (3.1) includes a large number of commonly- used stationary time series 
models. For instance, the processes in Examples 2.1 and 2.2 are of this form; I provide 
other examples below Proposition 3.1 in this section. 

The central conditions for the estimation of spectra are restrictions on the memory 
of the time series. As pointed out by Wu (2005), for time series of the form (3.1) 
such restrictions are most easily implemented by comparing Xt to a slightly perturbed 
version of itself. Let {eDtez be an iid copy of (£t)tez, so that the difference between X^ 
and Xj. := Y{£t, . . . , £i, £q, £*_i, . . . ) are the inputs before time t — 1. Define Xr{S) ■— 
e R : \^o{t) — ^\ < 5} and assume the fohowing: 

Assumption A. For a given r e (0, 1), there exist 5 > and a e (0, 1) such that 

sup ||l{X„<0-lK<OII=C'(a"). 

Intuitively, this condition requires the probability that Xn is below but X!^ is above a 
given threshold (or vice versa) to be sufficiently small for large n as long as the threshold 

is near ^o{t). It is the only dependence condition needed to construct asymptotically 
valid confidence intervals for the quantile spectrum. Assumption A avoids restrictions on 
the summability of the cumulants (Brillinger, 1975, pp. 19-21) of Xt that are routinely 
imposed in the spectral estimation literature; see Andrews (1991) and the references 
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therein. Cumulant conditions or "mixing" assumptions (Rosenblatt, 1984) that imply 
such conditions are sometimes difficult to establish for a given time series model and can 
easily fail or put unwanted restrictions on the parameter space when Xt is, for example, 
generated by a standard GARCH process (BoUerslev, 1986). 

Assumption A does not require the existence of any moments of Xj, but can be verified 
for most commonly used stationary time series models at the expense of an arbitrarily 
weak moment restriction via the geometric moment contracting (GMC) property intro- 
duced by Hsing and Wu (2004). A time series of the form (3.1) is said to be GMC for 
some q; > if — X'^\\a = 0{q^) for some q e (0, 1), where q may depend on a. 

Proposition 3.1. Assumption A is satisfied if Fx{x) := f{Xo < x) is Lipschitz contin- 
uous in a neighborhood of ^q{t) and \\Xn — X!^\\a — 0{q^) for some a > and g e (0, 1). 

The GMC property is satisfied for stationary (causal) ARMA, ARCH (Engle, 1982), 
GARCH, ARMA-ARCH, ARMA-GARCH, asymmetric GARCH (Ding, Granger, and 
Engle, 1993; Ling and McAleer, 2002), generalized random coefficient autoregressive 
(Bougerol and Picard, 1992), and QAR models; see Shao and Wu (2007) and Shao 
(2011b) for proofs and more examples. By Proposition 3.1, all of these models are 
included in the analysis if Fx is Lipschitz near ^o{t) — a condition that is also needed 
for all of my results. 

In addition to Lipschitz continuity, a restriction on Fx is required to ensure both that 
\4(r) can be estimated consistently and that \^{^n{T) —^o{t)) is bounded in probability: 

Assumption B. Fx is Lipschitz continuous in a neighborhood of^o{T) and has a positive 
and continuous (Lebesgue) density at Co(t)- 

This assumption, or slight variations thereof, is standard in the quantile estimation and 
regression literature; see, e.g., Koenker (2005, p. 120) and Wu (2007). 

As a preliminary step towards inference about quantile spectra, the following result 
establishes the joint asymptotic distribution of the quantile periodogram on a subset of 
the natural frequencies . . . , —An/n, — 27r/n, 0, 27r/n, 47r/n, . . . C (— tt, tt]. More precisely. 
Theorem 3.2 shows that the usual convergence of the periodogram at different frequencies 
to independent exponential variables is not affected by the presence of the estimated 
quantile Cn{T)- 

Theorem 3.2. Suppose Assumptions A and B hold for some r G (0, 1). Let An = 2TTjn/n 
with jn E 1j be a sequence of natural frequencies such that A„ — )■ A G (0, tt) with 5't(A) > 0. 
Then, for any fixed k G Z, the collection of quantile periodograms 

Qn,T (An - SttA;/ n) , Qn.T ( An - 27r ( A; - 1 ) /n) , . . . , Qn,T {K + SttA;/ n) 

converges jointly in distribution to independent exponential variables with mean gri^)- 

Remarks. 1. The natural frequencies induce invariance to centering in the quantity inside 
the modulus in (2.3) so we can write 

n n 

n~'^'Y.^t(T)e-'''- = J](l{X, < |„(r)} - Fx(|n(r)))e-*^". 

t=i t=i 
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Given the invar iancc, the strategy for the proof is to show that the empirical process 
on the right-hand side of the preceding display is stochastically equicontinuous with 
respect to an appropriate semi-metric on bounded sets near ^0(7") • For this I extend 
Andrews and Pollard's (1994) functional limit theorems to time series of the form (3.1) 
that satisfy Assumption A. The equicontinuity property and a result of Shao and Wu 
(2007) on classical periodograms at natural frequencies then yield the desired results. 

2. If a quantile of interest ^o{t) is assumed to be known, for example ^o(0.5) = as in 
Li (2008), then Theorem 3.2 remains valid when (i) ^q{t) is used in Qn,T instead of ^„(t), 
(ii) Assumption B is replaced by the condition that Fx is continuous and increasing at 
^o(t), and (iii) Assumption A is replaced by Assumption C below with 5 — 0. This is a 
direct consequence of Shao and Wu's (2007) Corollary 2.1. 

Theorem 3.2 yields a convenient way to construct point-wise confidence intervals for 
the quantile spectrum. The proof follows immediately from the properties of independent 
exponential variables. Example 3.4 provides an application. 

Corollary 3.3. Suppose the conditions of Theorem 3.2 are satisfied. Define Qn,T{\ k) ~ 
T^\i\<kQn,T{^n + 27rj/n)/(2/c -|- 1), and let X4k+2,a ^ quantile of a distribution 

with 4A; -|- 2 degrees of freedom. Then, for every fixed k & the probability of the event 

... f{^k + 2)QnA\k) {4k + 2)QnA>^,k) 

9t[^) I 2 ' 2 

V ^4fc+2,l-a/2 ^4A;+2,a/2 

converges to 1 — a. 

Example 3.4 (Testing for periodicities). The processes in Examples 2.1 and 2.2 are 
instances where Vt(ro) is a white noise series for some tq G (0, 1). Then the ro-th quantile 
spectrum of Xt is to(1 — tq)/ (27r) at all frequencies and therefore contains no periodicities 
at that quantile. Because a spike in the periodogram could either be evidence for a 
periodicity or an artifact generated by the sample, this leads to the problem of testing 
whether the To-th quantile spectrum behaves like a flat quantile spectrum at a given 
frequency. By Corollary 3.3, this hypothesis can be rejected at level a if the confidence 
interval in the Corollary does not contain tq{1 — ro)/(27r). The same type of test is not 
as simple in classical spectral analysis because (2.1) reduces to the unknown quantity 
7x(0)/(27r) if Xf is white noise. I extend the idea of testing for fiatness in section 4 
to provide a test for the more general hypothesis that groW — ^oi^ ~ '^o)/(27r) jointly 
across all frequencies. 

The results stated in Theorem 3.2 and its Corollary overlap to some extent with 
Theorem 2 of Li (2008). He uses the least absolute deviations estimator in the harmonic 
regression model 

n 

^„(A) = argmin V'po.5(^t - cos{tX)bi - sm{tX)b2), 

(6i,62)"reiR2 

to define the Laplace periodogram Ln{X) — n|/3n(A)p/4. In the special case that the time 
series of interest satisfies Xt — cos{tXo)/3i + sm{tXo)/32 + St, where Aq, and ^2 are 
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unknown constants, this approach has the advantage that the maximizer of Ln{X) can 
be used as a robust estimator of \q, although Li provides only Monte Carlo evidence of 
this assertion. For general time series, he assumes that Xt has median zero and a density 
with -Fjc(O) > 0, and that certain short-range dependence conditions are satisfied. 
The proofs of his Theorems 1 and 2 then yield an asymptotically hnear representation 
for Pni^n) that can be used to show 

LniXn) = Fj,(0)-2|n-V2 J2 \4(0.5)e-*^" I' + 0^(1). 

t=i 

The first term on the right-hand side is 27r/F^(0)^ times the quantile periodogram evalu- 
ated at the median. Hence, if the median of Xf is indeed zero, the Laplace periodogram 
and the quantile periodogram at the median are asymptotically equivalent up to the 
unknown constant 27r/F^(0)^. Li (2011) extends his idea of harmonic median regression 
to quantile regression. 

Using Li's (2008, 2011) periodograms instead of the quantile spectral methods intro- 
duced in my paper has the following disadvantages: (i) All of Li's asymptotic results 
depend on terms of the form r(l — t)/Fx{^{t)Y that in his case must be estimated 
to make inference about the dimensionless quantity 5't(A) even for simple tests such as 
in Example 3.4; my approach avoids this complication altogether, (ii) Li's methods re- 
quire quantile regression at every frequency, whereas the quantile periodogram (2.3) can 
be computed easily with the Fast Fourier Transform, (iii) Li does not provide consis- 
tent estimators. For example, L„(A) converges to a distribution with asymptotic mean 
[27r/(4F^(0)^)] X 5'o.5(A), but Li is unable to prove that a smoothed version of L„(A) 
converges in probability to this quantity. In contrast — as I will show now — the quan- 
tile periodogram can be smoothed by standard methods to provide uniformly consistent 
estimates of g^-. 

Consistent estimation of the quantile spectrum requires weaker conditions than the 
construction of confidence intervals because much of the randomness introduced by re- 
placing Tr (as defined above (2.2)) with fn,T is now controlled by the smoothing weight 
function w. Let £q be an iid copy of Eq such that Xt and X^* := Y{et, ■ ■ ■ , £i, Sq, £_i, . . . ) 
differ only through the input at time t = 0. I assume Xt satisfies the following: 

Assumption C. For a given r e (0,1) and Xr{5) as in Assumption A, there exists a 
5 > such that 

oo 

sup \\i{Xt<i}-i{x;<i}\\<^. 

t=o «ex,(5) 

Remarks. 1. Assumption A implies Assumption C in view of the relation ||1{^^ < C} ~ 
1{X* < = ||1{X^_^^ < C} - l{^n+i < Oil; see the discussion below equation [13] 
of Wu (2005). For ^ near ^o('''), adding and subtracting 1{X'^ < ^^'^ the triangle 
inequality then yield ||l{v^ri < ~ l{^n < Oil — 0{a'"), which remains valid after 
taking suprema over Xr{S). 

2. A stationary stochastic process is usually called short-range dependent if its auto- 
covariance function is summable. Since Xt can have heavy tails, this definition no longer 
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has the desired meaning. However, Remark 2.1 of Shao (2011a) can be used to show 
that Vt(r) is short-range dependent because 

provided Assumptions A or C hold. This suggests that these assumptions should still 
be regarded as short-range dependence conditions on Xt. Heyde (2002) argues similarly 
to quantify the dependence of the increments of certain Gaussian processes. 

Assumption C is easily verified in most cases via Proposition 3.1. However, more 
direct arguments can also be useful: 

Example 3.5 (Linear processes with Cauchy innovations). Consider the linear process 

Xt = Yl'jLo'^j^t-j, where (a.j)jeN is a sequence of constants and {et)t£Z is an sequence 
of iid copies of a standard Cauchy random variable. Without loss of generality, let 
ao = 1. Write for the distribution function of e; then Xt has distribution function 
Fx{x) = KF^{x — YlJLi'^j^t-j) and therefore also possesses a bounded density F'^ by 
the Lebesgue Dominated Convergence Theorem. Furthermore, apply the point- wise 
inequality |1{X„ < - H^n < 01 < H\^n - CI < l^n - X*\}, then the Mean Value 
Theorem and P(|£o| + kol > x) < P(|£o| > x/2) + P(|£^| > x/2) for any fixed x to see 
that 

iii{x„ < e} - i{x: < < n\Xn - ei < KWso - ei\) 

<P(|Xo-e| < |a„p/2)+P(|a„||£o-£ol > 
< 2|a„|^/'supFj,(x) + 2P(|£o| > |4a„|-^/'), 

which is 0(|a„|^/^) because the tail probability P(|£o| > a:;) of a Cauchy random variable 
is proportional to as x — > 00. Because these bounds hold uniformly in take square 
roots in the preceding display to conclude that Assumption C is satisfied if Xljlo I'^il^^^ < 
00. The same type of reasoning can be used more generally when the innovations come 
from a smooth distribution whose tails behave algebraically. Proposition 3.1 does not 
apply because (a„)„gN does not necessarily vanish at a geometric rate. 

Theorem 3.6 below establishes uniform consistency of the smoothed quantile peri- 

odogram under the condition that the bandwidth Bn grows at a sufficiently slow rate. 
In particular, due to the uniformity Theorem 3.6 allows for both fixed frequencies and 
sequences of frequencies such as the natural frequencies above. 

Theorem 3.6. // Assumptions B and C hold for some r G (0, 1), w G W, Bn 00, 
and Bn — o{^/n), then 

sup |^n,r(A) - gr{>) \ ^ 0. 

AG(-7r,7r] 

Remarks. 1. The proof of Theorem 3.6 relies in part on recent results for classical spectral 
density estimates obtained by Liu and Wu (2010). 

2. At fixed frequencies, kernel spectral density estimates of differentiable functions 
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arc often valid for bandwidths up to order S„ = o{n); sec, e.g., Andrews (1991) and 
Davidson and de Jong (2000). The stronger requirement Bn = o{^/n) reflects that Vt{r) 
is not a smooth function of ^^(t). However, this requirement is not much of a restriction 
because, as Andrews notes, optimal bandwidths are typically of order less than y/n. 

3. If the quantile of interest ^o(''') is assumed to be known, then Theorem 1 of Liu 
and Wu (2010) implies that Theorem 3.6 continues to hold when (i) ^o(''") is used in the 
definition of gn,T instead of ^ra(r), (ii) Assumption B is replaced the condition that Fx 
is continuous and increasing at io{T), (hi) 5 = in Assumption C, and (iv) Bn = o{n). 

4. The smoothed quantile periodogram at a known quantile io{T) is just an ordinary 
smoothed periodogram of 14 (r) and therefore optimality results from classical spectral 
analysis apply. In particular, the optimal lag window among the kernels in Wfljiy > 0} 
with respect to the relative mean-square error (MSE) criterion of Priestley (1962) is the 
quadratic-spectral (QS) window 

25 /sin(67rx/5) \ 
"^^^")=12;^(,-6W^ cos(6W5)J. 

The mean-square optimal bandwidth for the QS kernel is Bn = 0(n^/^), which can be 
established under additional dependence conditions; for example. Assumption A with 
5 — suffices. In the general case where ^o(''") is estimated, a truncated MSE criterion 
as in Andrews (1991) could be used to limit the infiuence of ^n(T)- However, his results 
rely crucially on second-order differentiability of the smoothed periodogram with respect 
to the estimated parameter. A fundamentally different approach is therefore hkely to 
be needed, which I leave for future research. 

I investigate the finite sample properties of the smoothed quantile periodogram and 

confidence intervals based on the quantile periodogram in a small simulation stTidy in 
section 5. The next section discusses the use of integrated quantile periodograms to test 
for uninformative quantile spectra. 

4. Testing for Flatness of a Quantile Spectrum 

In this section I provide two Cramer-von Mises tests (Procedures 4.4 and 4.6 below) for 
the null hypothesis that the r-th quantile spectrum is fiat, i.e., Qri^) = t(1 — r)/(27r), 
against the alternative that the r-th quantile spectrum is informative. 

If the distribution function of Xt is continuous and increasing at Co{t), then rr{0) — 
t(1 — t) and the null and alternative hypotheses can be stated more precisely as 

Ho '■ frij) = for all j > and Hi : ri-(j) 7^ for some j > 0. 

Provided that X]jez'"T(i) converges absolutely, the r-th quantile spectrum is symmetric 
about zero. One way to test for the null hypothesis is therefore to check if the sample 
equivalent of 

J\r{u) f/-^d^ = J2^r{j)M>^), (4.1) 
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where ipj{X) := sin(jA)/(7rA), is near zero for all A G IT := [O.tt]. 

The quantity in the preceding display is best understood as an function in ^2(11), 
the set of Lebesgue-measurable functions / : 11 — )■ M with J^f{XydX < oo. Under the 
equivalence relation "/ = if and only f = g Lebesgue-almost everywhere," 1^2(11) is a 
proper Hilbert space with inner product {f,g)u '■= /n f{X)g{X) dX for f,g& -^2(n) and 
norm ||/||n := ^/ifJh- Since = 1/(27: f) for all j G Z\{0}, (4.1) indeed lies in 

L2(n) and satisfies 



i>o 



j>0 



Here we need the fact that {ipj, ipk)u = for all j ^ k. Now replace rr{j) by rn,T{j) and 
rescale to obtain the Cramer-von Mises statistic 



CMr 



n—l 



n 



n-l 

n \ - 



based on the random process S'„^^(A) := \/nYTjZi fn,T{.j)i' j{X) in -^2(11). No smoothing 
weight function and bandwidth is needed because the integral in (4.1) already acts as 
a smoothing operator. The scaling factor ^/n in Sn,T is included because \/nfn,T{3) can 
be expected to have an asymptotically normal distribution for each j > under the null 
hypothesis. When viewed as a random function on iv2(n), the process Sn^j- should then 
converge in distribution to a mean-zero Gaussian process St{X) with covariances 

E5.(A)5.(A') = 5]5]5^Cov(Vo(r)V,(r), V,_,(r)y,_,_,(r))^,(A)^,(A'), (4.2) 

j>0 fc>0 i& 

A, A' G n, so that CMn,T ll'S'rIln by the Continuous Mapping Theorem (see, e.g.. 
Theorem 18.11 of van der Vaart, 1998, p. 259). 

As the following Theorem shows, this convergence indeed occurs if the conditions 
of the null hypothesis are strengthened slightly: Suppose that under Hq, for a given 
T G (0, 1) there is a 5 > such that 

P(Xo < Xj < = P(Xo < OP(^o < O for all j > and all ^' G X,(5), (4.3) 

where Xr{S) = G M : l^oi^) < ^} as before. The role of this condition is discussed 
in detail in the Remarks and Examples below. 

Theorem 4.1. Suppose Assumptions A and B hold for some r G (0, 1). 

(i) /fHo is satisfied in the sense of (4.3), then CM^^t ^ ll'S'TlIn? 

(ii) z/Hi is satisfied, then P(CM„,^ > B) ^ I for every S G R. 

Remarks. 1. For the proof of the Theorem, I show stochastic equicontinuity of the em- 
pirical process (^ — j)~^^^ X]j=i [^(''"j ~ ^EK)(t, 0] indexed by ^ under 
Assumptions A and B for each fixed j. Condition (4.3) is used to control the behavior 
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of Tn.rii) for large j and n. These two results then allow me to apply a general result 
on Cramer- von Mises tests for spectral densities given in Shao (2011a). 

2. Condition (4.3) imposes slightly more on the dependence structure of Vtir) than the 
white noise assumption Hq (i.e., 5 = 0). However, since 5 can be chosen to be as small as 
desired, it is much less restrictive than requiring that (^t)tez be pairwise independent 
(5 = oo) or even iid, which is frequently imposed when testing for white noise; see, e.g., 
Milh0j (1981) and Hong (1996). 

Example 4.2 (Stochastic volatility, continued). Recall that is the distribution func- 
tion of e. The stochastic volatility process in Example 2.1 has a flat ro-th quantile 
spectrum but fails to satisfy (4.3) because 

P(^o < e,^. < r) = El|., < ^^M^Uf ^;-^o(ro) \ 

can generally not be simplified further due to the lagged innovations contained in the 
volatility process v. Thus, Theorem 4.1 does not apply. However, the test procedure 
from Example 3.4 can still be used in this case to test for flatness of the rg-th quantile 
spectrum, for if 5'to(Ao) = to(1 — ro)/(27r) is rejected at some frequency Aq, then Hi 
must be true. Linton and Whang (2007) investigate the stochastic volatihty model of 
Example 2.1 with the sample quantilogram, defined as r„,r(j)/^n,T(0), for a fixed, finite 
number of lags j = 1,2,.... From their results it can be seen that the failure of (4.3) 
for the stochastic volatility model manifests itself in terms of a non-vanishing drift term 
in y/nfn,T{j) due to the estimation of Co(t)- A Cramer- von Mises test requires control 
of these drifts for large j and n; this is nontrivial and left for future work. 

Example 4.3 (QAR, continued). The QAR process in Example 2.2 possesses a flat 
To-th quantile spectrum and has the property (4.3) if there exists a neighborhood T of 
To such that /3i(t) = (32{t) = for all r G T: In this case, the conditional quantile 
function, defined as the solution ^(r | 3^t-i) of P(Xf < ^ \ 3^t-i) = t, is given by 
e(r I Ji_i) = /3o{t) + l3i{T)Xt-i + l32{T)Xt-2 = l3o{r) almost surely for all r E 7 by 
monotonicity. Take expectations to deduce that 

T = P(X, < e(T I I = < (3o{t)) = F{Xo < Coir)), 

almost surely for all r e T and therefore ^(r | 3^t) — Coir) almost surely on r e T. 
Conclude that for any r, r' e T, 

P(Xo < Ur),Xj < Ur')) = El{Xo < eo(r)}P(X, < eo(r') | 3^,-i) 

= P(Xo<eo(r))P(Xo<eo(r)). 

Now (4.3) follows because as long as Fx is continuous and increasing in a neighborhood 
of ^o(to)) there is a 5 > such that for every C,, C,' G Xt-,)(5), there are t,t' E 7 such that 
C — ioir) and C' = ioir'). The assertion in Example 2.2 about the flatness of the ro-th 
quantile spectrum is obtained by letting T = {tq}. 

The main difficulty with applying Theorem 4.1 in practice is the unknown covariance 
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function (4.2) of the limiting process Sr- In standard spectral analysis, this has led 
researchers to assume that Xt is iid normal under the null hypotheses of white noise 
(Durbin, 1967, is an important early reference) in order to avoid having to estimate 
the covariance function of a Gaussian process. In sharp contrast, in quantile spectral 
analysis the assumption that Xt is iid is already enough to construct a test for flatness 
without imposing a distributional assumption: In large samples Vt(r) is close to Vf(r) — 
T — l{Xt < Co(t)} in probability, but l{Xt < is a Bernoulli random variable with 

success probability r as long as Fx is continuous and increasing at ioir). Hence, if Xt 
is indeed iid and Ji, J2, . . . , J„ are independent Bernoulli(T) variables, then 

^ n— 1 / \ 2 

^^".^ ■■=^Y.r\ E yti^)yt-Ar)) and 

j=l \=l+j ^ 

have the same distribution. Because CM^^r = CMn^r + Op(l) under the conditions of 
Theorem 4.1(i), this distributional equivalence leads to a simple, distribution-free Monte 
Carlo test. I prove its consistency in Corollary 4.5 below. 

Procedure 4.4 (Monte Carlo test for flatness). 1. Draw n iid copies Ji, J2, ■ ■ ■ , Jn of 
a Bernoulli (r) random variable. 

2. Compute CM'^ .^ with the variables from step 1. 

3. Repeat steps 1 and 2 R times. Reject Hq in favor of Hi if CMn,T is larger than 
Cn,r(l ~ «)j the 1 — a empirical quantile of the R realizations of CM'^^. 

Remark. Exploiting the distribution-free character of sign or quantile crossing indicators 
has a long history in statistics and econometrics; see, e.g., Walsh (1960). More recently, 
Chernozhukov, Hansen, and Jansson (2009) use it to construct flnite sample confldence 
intervals for quantile regression estimators. 

By choosing the number of Monte Carlo repetitions R large enough, the quantiles of 
the null distribution of CMn,T can be approximated with arbitrary precision. I therefore 
let i? — 7> 00 and define the quantiles of the simulated distribution directly as c„^^(l — «) : = 
inf{a; e M : P(CM„_^ > a;) < a}. The large sample properties of Procedure 4.4 can now 
be stated as follows: 

Corollciry 4.5. Suppose Assumption B holds for some r e (0, 1) and let a e (0, 1). 

(i) If {Xt)tez is an iid sequence, then P(CM„^^ > c„,t-(1 — a)) — >■ a, and 

(ii) if instead Assumption A and Hi are satisfied, then F{CMn,T > Cn,T(l — ct)) — >■ 1- 
Remark. If ^o(t) is known, then the test in Procedure 4.4 has level a even in flnite 
samples provided that CMn,T is used in step 3 instead of CMn.r- 

In cases where it does not seem reasonable to assume that Xt is iid under the null 
hypothesis, the block-wise wild bootstrap of Shao (2011a) should be used instead. This 
bootstrap is a modiflcation of the standard wild bootstrap (Liu, 1988; Mammen, 1992). 
It perturbs whole blocks of observations with iid copies of a random variable 77 that is 
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independent of the data and satisfies Krj = 0, Ery^ = 1, and Kr]"^ < oo. Since the blocks 
grow with the sample size, this eventually captures enough of the dependence structure 
to provide critical values for the null distribution under the more general condition (4.3). 

Procedure 4.6 (Shao's block- wise wild bootstrap). 1. Choose a block length hn<n 
and the corresponding number of blocks = n/bn, taken to be an integer for 
convenience. For each s — 1, . . . define a block "Bg — {{s — l)&n + 1, ■ ■ ■ , sbn}- 

2. Draw L,„ iid copies rji,rj2, ■ ■ ■ ,1]^^ of r/. For each t = l,...,n, define cut — 
Xlf^i Vs^{t ^ ^s} so that ujt takes on the value rjg if t lies in the s-th block. 

3. Compute fn^rU) •= X^tLj+i[^t('^)^-j(''") ~ 'f'n,T{j)]^t and calculate the boot- 
strap statistic 

4. Repeat steps 2 and 3 R times. Reject Hq in favor of Hi if CMn,T is larger than 
c* ,.(1 — a), the 1 — a empirical quantile of the R realizations of CM*^^^. 

Remark. The recommended choice for 77 in practice is a Rademacher variable that takes 
on the value 1 with probability 1/2 and the value —1 with probability 1/2. Distributions 
other than the Rademacher distribution can be used for 77, in particular if V((r)V(_j(r) 
has a skewed distribution, but there is no evidence that they would lead to better 
inference; see Davidson, Monticini, and Peel (2007) for a discussion of this point for the 
standard wild bootstrap. 

As before, I take R to be large and define the quantiles of the bootstrap distribution 
conditional on the sample S„ as c* ,.(1 — a) — inf{x e R : P(CM* < x \ S„) > 1 — a}. 
Procedure 4.6 then has the following asymptotic properties: 

Theorem 4.7. Suppose Assumptions A and B hold for some r e (0, 1). Let a e (0, 1), 
hn ^ 00 and h^jn — ?> 0. 

(i) //Ho is satisfied in the sense of (4.3), then P(CM„_^ > c* ^(1 — a)) a, and 

(ii) i/Hi is satisfied, then F{CMn,r > c*_r(l - a)) ^ 1. 

Remark. If Co{^) is known, then Theorems 4.1 and 4.7 remain valid without condition 
(4.3) as long as CMn,T is used in place of CMn,T- 

The next section investigates the finite sample properties of the two Cramer-von Mises 
tests, the quantile periodogram, and the smoothed quantile periodogram in a Monte 
Carlo study. 

5. Simulation Study 

In this section I present a sequence of examples to illustrate quantile spectral methods 
in the context of some familiar time series models, and compare the results to those 
obtained from traditional spectral analysis. 

Example 5.1 (AR(2) with spectral peak). Let {et)t£Z be iid copies of an N(0, 1) variable 
with distribution function Li (2008) investigates the frequency domain properties of 
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Figure 1: Left panel: spectral density (dotted line) of Xt as in (5.1), QS-smoothed periodogram (solid) 
of a realization with n = 300 and i?„ = 13n^^^^ ~ 40.68, chi-squared point- wise 95% confidence bands 
(shaded grey) with A; = 4, and 7„,x(0)/(27r) (dashed). Right panel: median spectrum (dotted) of 
Xt, QS-smoothed median periodogram (solid), chi-squared point-wise 95% confidence bands (shaded 
grey), and 0.5(1 — 0.5)/(27r) (dashed). Both panels use the same data, and k, and are normalized 
by the maxima of their respective spectra. 



a stationary AR(2) process of the form 

Xt = PiXt-i + P2Xt-2 + et, /3i = 2 X 0.95cos(27r x 0.22), = -0.95^ (5.1) 

Shao and Wu's (2007) Theorem 5.2 imphes that Xt is GMC for all a > 0. Since Xt is 
also normally distributed. Proposition 3.1 applies and consequently Assumptions A and 
B hold. To study the finite sample properties of classical and quantile spectral estimates 
for sample sizes n G {300, 600, 900} in this model, I generated 10,000 realizations of the 
process of size 400 + n for each n and then discarded the first 400 observations. Each 
realization was initialized by independent standard normal random variables. The solid 
black fine in the left panel of Figure 1 plots a QS-smoothed periodogram of Xt, i.e., 

/n,x(A) ^ ^Qs(j7^n)7n,x(j)cOs(jA), 

\j\<n 

where %,x{j) := Ylt=\j\+i(^t - ^n)(^t-b| - ^n) and X„ := n-'^J2t=i^t: of one 
such realization with n — 300 and i?„ = 13n^/^ ^ 40.68. The process (5.1) has little 
noise and a single pronounced peak at 27i x 0.22 in its spectral density, shown as the 
dotted line in the left panel of Figure 1. The smoothed periodogram therefore does not 
have much difficulty identifying the peak, although its size is underestimated slightly due 
to the smoothing. The shaded area in the left panel shows 95% asymptotic point-wise 
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confidence bands based on the periodogram of Xt, defined as 

4,x(A) ^^Yl 7n,x(i)cos(jA). 

Iil<n 

The point-wise confidence bands were computed by averaging over 2k + 1 periodogram 
coordinates at natural frequencies in the same way as in Corollary 3.3, but with Qn,T 
replaced by In,x- Here and in all plots below, I used k = 4. The dashed line in the left 
panel plots 7n,x(0) / (27r), i.e., the usual estimate of fx if the spectrum were known to be 
flat. It provides a natural point of comparison for the other quantities; in particular, it 
can be seen from the left panel that the peak at 27r x 0.22 is significantly different from 
a flat spectrum at the 5% level. 

The right panel of Figure 1 analyzes the same data with quantile spectral methods. 
The black fine is the QS-smoothed median (i.e., 0.5-th quantile) periodogram and the 
shaded area graphs 95% point- wise confidence bands computed as described in Corollary 
3.3. Here I used the same values for i?„ and k as in the left panel. The dashed line 
is 0.5(1 — 0.5)/(27r), i.e., the median spectrum under the hypothesis that it is flat. 
The dotted line shows the median spectrum 5(0.5, which can be calculated exactly from 
equation (6) in Li (2008). The smoothed median periodogram clearly identifies the 
peak, although the estimate of the actual size of the peak is shghtly worse than the one 
obtained in the left panel. However, the median spectrum is completely contained inside 
the confidence bands and the peak at 27r x 0.22 differs significantly from a fiat median 
spectrum at the 5% level. 

For both panels the choice of S„ and k matters, with lower values of Bn and higher 
values of k leading to smoother — but not necessarily better — estimates: Figure 2 shows 
the mean integrated square error (MISE) of the QS-smoothed periodogram (left panel) 
and the QS-smoothed median periodogram (right) estimated from the 10,000 realizations 
as a function of Bn/n^^^. Here the behavior of both methods is quite similar and the 
MISEs attain their minimum at Bjn^^^ ^ 13 for each n e {300,600,900}, which 
provides evidence that the optimal growth rate B^ — 0{v}/^) for the QS-smoothed 
periodogram is also a good choice for QS-smoothed quantile periodograms. Further, 
Table 1 shows the empirical frequency of the event that the 95% confidence interval 
at A e {tt X 0.22, 2ti x 0.22, 37r x 0.22} covered the spectrum and median spectrum, 
respectively, in the experiments for k e {2, 4, 6} and n as before. The confidence intervals 
constructed from the periodogram and the median periodogram behaved very similar at 
the three frequencies and covered the population value in nearly 95% of all cases unless n 
was small and k was large. For these values both methods had low coverage frequencies. 

Robust estimators (in the sense of Huber and Ronchetti, 2009, p. 5) exhibit stability, 
i.e., small deviations from the model assumptions should have small effects on the perfor- 
mance of the estimator, and high breakdown resistance, i.e., larger deviations should not 
cause catastrophic results. The following two examples illustrate that classical spectral 
estimates are not robust to outliers in the data, whereas quantile spectral estimators 
provide reliable results in such situations. 
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Figure 2: Emprical MISE of the QS-smoothed periodogram (left panel) and the QS-smoothed median 
periodogram (right) for three different sample sizes as a function of S„/n^/^. Both panels were 
separately normalized by the respective joint maximum of the three curves. 



Table 1: Finite-sample coverage frequencies of an asymptotic 95% confidence interval (CI) for the spec- 
trum and median spectrum of the process in Example 5.1 at A € {tt x 0.22, 27r x 0.22, Stt x 0.22} as 
a function of n and k. 



n 


k 


Periodogram 


CI 


Median Periodo^ 


^ram CI 


TT X .22 


271 X .22 


3n X .22 


vr X .22 


271 X .22 


37r X .22 


300 


2 


0.940 


0.931 


0.937 


0.937 


0.982 


0.961 




4 


0.936 


0.676 


0.931 


0.924 


0.907 


0.974 




6 


0.921 


0.249 


0.909 


0.913 


0.178 


0.979 


600 


2 


0.943 


0.951 


0.948 


0.942 


0.982 


0.956 




4 


0.946 


0.915 


0.947 


0.938 


0.980 


0.962 




6 


0.944 


0.774 


0.941 


0.926 


0.930 


0.965 


900 


2 


0.950 


0.951 


0.948 


0.948 


0.974 


0.956 




4 


0.949 


0.941 


0.946 


0.940 


0.980 


0.959 




6 


0.948 


0.904 


0.947 


0.934 


0.971 


0.964 
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Figure 3: Left panel: spectral density (dotted line) of the process in Example 5.2, QS-smoothcd pc- 
riodogram (solid) of a realization with n = 300 and Bn — 13n^/^ « 40.68, chi-squared point-wise 
95% confidence bands (shaded grey) with fc = 4, and 7„^j)c(0)/(27r) (dashed). Right panel: median 
spectrum (dotted), QS-smoothed median periodogram (solid), chi-squared point-wise 95% confidence 
bands (shaded grey), and 0.5(1 — 0.5)/(27r) (dashed). Both panels use the same data, Bn, and k, and 
are normalized by the maxima of their respective spectra. 



Example 5.2 (Stability of quantile spectral estimators). Suppose that each observation 
in a realization of the AR(2) process from Example 5.1 has a probability p of being 

contaminated by an additional additive error component. For this I drew iid Bcrnoulli(p) 
variables Ji,...,J„ and iid central Student t(z/) variables Ti,...,T„ to generate the 
observed samples as S„ = {Xt + JtTt : t = 1, . . . ,n}, where the Xi, . . . , X„ were taken 
from Example 5.1. The spectral density of the corresponding process {Xt + JtTt)tez is 

which, for any given p, can be made as large as desired by choosing z/ > 2 sufficiently 
close to 2 without violating the assumptions of classical spectral theory. Figure 3 
plots /x-i-jt(A) for p = 0.15 and i/ = 2.001 as a dotted line in the left panel; the me- 
dian spectrum (dotted, right) needed no adjustment because it is invariant under such 
contamination. The other quantities are the same as in Figure 1 and the same 300 
observations were used, but 46 of these were contaminated. The smoothed periodogram 
retains the spectral shape and has a significant spike at 27r x 0.22, but grossly underes- 
timates the location of the spectrum. Moreover, the confidence bands no longer contain 
the spectrum at any frequency. In sharp contrast, the smoothed median periodogram is 
barely affected by the contamination and the confidence bands cover the median spec- 
trum at almost all frequencies. The hypothesis that go,5{2n x 0.22) = 0.5(1 — 0.5)/(27r) 
can also be clearly rejected. 
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I repeated the experiment from Table 1 with the contaminated data. The estimated 
coverage probabihties for the confidence intervals constructed from the periodogram and 
the median periodogram are shown in Table 2. As can be seen, the presence of outliers 

Table 2: Finite-sample coverage frequencies of an asymptotic 95% confidence interval (CI) for the spec- 
trum and the median spectrum of the process in Examples 5.2 as a function of n and k. 



n 


k 


Periodogram 


CI 


Median Periodof 


^ram CI 


TT X .22 


27r X .22 


3% X .22 


TT X .22 


27r X .22 


37r X .22 


300 


2 


0.001 


0.170 


0.002 


0.918 


0.976 


0.958 




4 


0.001 


0.002 


0.001 


0.900 


0.858 


0.965 




6 


0.001 


0.001 


0.001 


0.886 


0.089 


0.972 


600 


2 


0.001 


0.296 


0.001 


0.928 


0.976 


0.952 




4 


0.001 


0.016 


0.001 


0.914 


0.966 


0.958 




6 


0.001 


0.002 


0.001 


0.901 


0.890 


0.963 


900 


2 


0.001 


0.341 


0.001 


0.932 


0.974 


0.948 




4 


0.001 


0.035 


0.001 


0.924 


0.973 


0.957 




6 


0.002 


0.002 


0.002 


0.912 


0.953 


0.957 



had little effect on the performance of the quantile spectral estimates. In sharp contrast, 
the coverage probability for the classical spectrum was almost zero in most cases and 
0.341 in the best scenario {k — 2, n — 900). 

Example 5.3 (Breakdown resistance of quantile spectral estimators). Now suppose in- 
stead that each observation from Example 5.1 has a 15 percent chance of being contam- 
inated by one of the iid Cauchy(0, 1) variables Ci, . . . , C„. The observed samples then 
were = {Xt + JtCt : t = 1, . . . ,n} with the Xi, . . . , Xn as before. Since these outhers 
do not have a well defined mean, the spectral density of the corresponding contaminated 
process no longer exists. Spectral analysis by ordinary methods broke down completely 
when 46 of the 300 observations used for Figure 1 were contaminated: The smoothed 
periodogram in Figure 4 no longer has the expected spectral shape and fails to give any 
indication of a periodicity present in the data. A comparison of the confidence bands to 
the estimate of 7x(0)/(27r) now provides overwhelming evidence for the false hypothesis 
that the process is white noise. In sharp contrast, the median spectrum is unaffected 
by the contamination and the smoothed median periodogram significantly identifies the 
periodicity. In addition, the confidence bands remain essentially unchanged from Exam- 
ple 5.2, which is also confirmed by the coverage probability estimates of the confidence 
intervals constructed from median periodograms provided in Table 3. Here the esti- 
mates were nearly identical to the ones presented in Table 2 for the median spectrum. 
Corresponding estimates for the classical spectrum cannot be computed because it is 
unbounded at all frequencies. 

For the next Monte Carlo exercise, I return to the stochastic volatility model from 
Example 2.1 to illustrate that even if the classical spectrum shows no sign of periodicity, 
almost all quantiles of the distribution can be crossed in a periodic manner. 
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Frequency Frequency 



Figure 4: Left panel: QS-snioothed periodogram (solid black) of a realization of the process in Example 
5.3 with n = 300 and Bn — ISn^/'"' w 40.68, chi-squared point-wise 95% confidence bands (shaded 
grey) with k — A, and 7„ j!f (0)/(27r) (dashed). The spectral density does not exist. Right panel: 
median spectrum (dotted), QS-smoothed median periodogram (sohd), chi-squared point-wise 95% 
confidence bands (shaded grey), and 0.5(1 — 0.5)/(27r) (dashed). Both panels use the same data, 
Bn, and k, and are normalized by the maximum of the upper confidence band (left) and median 
spectrum (right). 



Table 3: Finite-sample coverage frequencies of an asymptotic 95% confidence interval for the median 
spectrum of the process in Example 5.3 as a function of n and k. 



Median Periodogram 



n 


k 


TT X .22 


27r X .22 


37r X .22 


300 


2 


0.904 


0.971 


0.957 




4 


0.889 


0.827 


0.960 




6 


0.861 


0.059 


0.966 


600 


2 


0.915 


0.973 


0.948 




4 


0.901 


0.961 


0.952 




6 


0.884 


0.872 


0.955 


900 


2 


0.918 


0.970 


0.951 




4 


0.905 


0.966 


0.952 




6 


0.883 


0.940 


0.951 



22 



Example 5.4 (Stochastic volatility, continued). Take {et)tei, to be iid copies of an 
N(0, ^^^) variable and let Ut = logv{et-i,et-2, ■ ■ ■ ) be the stationary solution of the 
process Ut = /3iMt-i + fi2Ut-2 + ^t-i with /3i,/32 as is (5.1). Then e"* is log-normally dis- 
tributed and Xt — etv{et-i, £4-2, • • • ) = ^t^"* median zero. To show that Xt is GMC, 
apply the Mean Value Theorem and the Cauchy-Schwarz inequality to obtain the bound 
\\Xn — X'^Wa < ll^nlUlle"" ||2a||M„ — ti[j||2a, where u'^ is Un with {so,S-i, . . . ) replaced by 
{eq, e*_i, . . . ) and lies on the line segment joining Un and m^. By monotonicity of the 
exponential function and the Minkowski inequality, we have 

||e""|Uin{i,2a} < ||max{e"",e-"",e<,e-<}L.^^^ 2^j < 4||e""|Uin{i,2a} < 00 

because the four terms inside the maximum have the same log-normal distribution. If 
needed, the Loeve inequality provides a similar bound for the case < 2q; < 1. The 
GMC property then follows since Ut is GMC by Theorem 5.2 of Shao and Wu (2007). 
The distribution function of Xt is given by Fx{x) = E$(a;/(e"*^)), which can be seen to 
have a bounded density with the help of the Lebesgue Dominated Convergence Theorem. 
Therefore, Assumptions A and B again hold. 

The top two panels of Figure 5 graph the same spectral estimates as in Figures 1 
for n — 600 observations of the stochastic volatility model with 9 — 1. The spectrum 
(not shown to prevent clutter) and the median spectrum (identical to the dashed line 
in the top right panel) of the model are fiat, which is also correctly identified at almost 
all frequencies by both point-wise confidence bands. The bottom two panels show the 
smoothed quantile periodograms (black lines) and point-wise confidence bands (shaded 
grey) at r = 0.25 (left) and r = 0.75 (right) computed from the same data. In both 
panels, the estimated quantile spectra show a considerable spike that is significantly 
different from a flat r-th quantile spectrum at frequency 27r x 0.22, thereby providing 
evidence of a dependence structure that is not present in the mean and auto-covariance 
of the process. Since the quantile spectra of the process do not possess a closed-form 
expression for r 7^ 0.5, I instead also plot smoothed quantile periodograms of n = 10^ 
observations at r = 0.25 (left) and r = 0.75 (right) as dotted lines in the bottom panels 
to illustrate how much of the spectral shape is already recovered in a sample with 600 
observations. Indeed, although the estimates from the smaller sample are more volatile, 
the size and shape of the peaks at 27r x 0.22 are nearly identical for the two sample sizes. 

To evaluate how reliably the quantile spectral estimates discover the cycle at frequency 
27r X 0.22, I recorded the relative number of the test decisions in favor of the hypothesis 
Hq: 5'r(27r x 0.22) = r(l — r)/(27r) in 10,000 realizations of the stochastic volatility 
model using a 95% confidence interval with k = A. The results are shown in Figure 6 for 
different sample sizes as a function of r e (0, 1). At r = 0.5, the null hypothesis is true 
and the tests almost attained the 5% level (lower grey line) for the three sample sizes. At 
the other quantiles, the null hypothesis is false, which was also correctly recognized at 
all sample sizes as long as a quantile not too close to r = 0.5 was chosen. In particular, 
near the quartiles the power of the tests was close to one (upper grey line). 

The additional information obtained from quantile spectral analysis can also be seen 
in Figure 7, where I graph the QS-smoothed quantile periodogram as a function of both 
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Figure 5: Top left panel: QS-smoothed periodogram (solid) of a realization of the process in Example 

5.4 with n = 600 and i3„ = 13n^^^ « 46.73, chi-squared point-wise 95% confidence bands (shaded 
grey) with fc = 4, and 7„,x(0)/(27r) (dashed). Other panels: QS-smoothed r-th quantile periodogram 
(solid), chi-squared point-wise 95% confidence bands (shaded grey), and r(l — t)/{2-k) (dashed) for 
r = 0.5 (top right), 0.25 (bottom left), and 0.75 (bottom right). All panels use the same data, i3„, 
and k. The top left panel is normalized by the maximum of the upper confidence band. The other 
panels are normalized by the joint maximum of the three upper confidence bands. The bottom two 
panels also show QS-smoothed r-th quantile periodograms (dotted) with n = 10^ for r = 0.25 (left) 
and 0.75 (right). 
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Figure 6: Empirical size and power of a test for a cycle with frequency 2tt x 0.22 in the stochastic 
volatility model of Example 5.4 as a function of r. Nominal size at r = 0.5 is 0.05 (lower grey line). 

A and r. Here I chose n — 900 and S„ = %v}/^ ^ 31.18 for a smoother appearance of the 
plot. The two humps in the figure make it clear that most of the dependence structure 
is in fact present near the lower and upper quartiles of the process, whereas working 
with the mean or median provides no insight in this case. 

Finally, the following examples illustrate the size and power of the two Cramer-von 
Mises tests introduced in section 4. 

Example 5.5 (QAR(2) and Procedure 4.4). Table 4 shows the empirical rejection fre- 
quency of the null hypothesis of a fiat r-th quantilc spectrum as a function of n G 
{100, 200, 300} and r G {0.1, 0.5, 0.9} in a variety of settings. For each entry, I recorded 
the test decision of Procedure 4.4 in 10,000 realizations by comparing the test statistics 
to 5% critical values obtained from 10^ simulations each. The first column of the "Size" 
portion provides the rejection frequencies when the data were iid x| variables. In this 
case, the null hypothesis is true at all quantiles. The test behaved mildly conservatively 
for r = 0.1 in smaller samples, but was close to the level of the test at other quantiles 
and samples sizes. In samples larger than 300 (not reported), the test was essentially 
exact at all quantiles. I also experimented with other distributions, including normal. 
Student t(2), and standard Cauchy variables, but found that they had little impact on 
the results. 

The first column of the "Power" portion shows the relative number of rejections when 
the data-generating process was the AR(2) from Example 5.1. Here the null hypothesis 
is false at all quantiles, which was also reliably identified at the median at all samples. 
However, at the outer quantiles the spectral peak is smaller and therefore larger samples 
were needed to detect its presence. The results for the contaminated processes from 
Examples 5.2 and 5.3 are not shown because they were virtually identical. 
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Figure 7: QS-smoothed quantile periodogram across all quantiles of a realization of the process in Ex- 
ample 5.4 with n = 900 and B„ = 8n^/^ « 31.18, normalized by the joint maximum of all quantile 
periodograms. 



Table 4: Rejection frequencies of the null hypothesis for the Monte Carlo Cramer-von Mises test (Pro- 
cedure 4.4) at the 5% level. 



n 


r 




Size 






Power 




xi 


Ex. 5.4 


QAR 


Ex. 5.1 


Ex. 5.4 


QAR 


100 


0.1 


0.022 




0.024 


0.093 


0.007 






0.5 


0.053 


0.068 




0.999 




0.999 




0.9 


0.037 






0.169 


0.332 


0.993 


200 


0.1 


0.019 




0.021 


0.405 


0.043 






0.5 


0.052 


0.076 




1.000 




1.000 




0.9 


0.046 






0.504 


0.468 


1.000 


300 


0.1 


0.048 




0.029 


0.795 


0.188 






0.5 


0.052 


0.080 




1.000 




1.000 




0.9 


0.050 






0.875 


0.724 


1.000 
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The second "Size" and "Power" columns give the rejection frequencies for the stochas- 
tic volatility model from Example 5.4. The null hypothesis is true at r = 0.5, but the 
process is not covered by the assumptions underlying the Monte Carlo test because the 
stochastic volatility model is not iid, which resulted in a mild over-rejection at all sam- 
ple sizes. At the other quantiles, the process satisfies Hi and the test has power against 
this alternative by Corollary 4.5(ii). The power of the test increased sharply with the 
sample size for r = 0.9, whereas for r = 0.1 the increase was considerably slower. Some 
intuition for this result can be gathered from Figure 5, where the estimated quantile 
spectrum in the lower quantiles can be seen to have a long stretch on which it is close 
to the hypothetical quantile spectrum implied by the null hypothesis. In contrast, this 
stretch is somewhat shorter in the upper quantiles. Moreover, as shown in Figure 6, the 
setup for the test is quite demanding because the spectral peak near the extremes of 
the distribution is small. Larger samples (not reported) yielded better results, with the 
power being nearly one at all quantiles for n = 600. 

The third columns of the "Size" and "Power" portions show the relative number of 
rejections of the hypothesis of a flat r-th quantile spectrum for realizations of the QAR(2) 
process (see Example 2.2) 

Xt^4 + ^-\et) + 0.8 X l{et > 0.2} Xt-i + 0.6 x l{et > 0.6} Xt-2 (5.2) 

^ V ' ^ V ' ^ V ' 

Met) Met) Met) 

where, as before, {e)tez is a sequence of iid copies of a Uniform(0, 1) variable. By 
Theorem 5.1 of Shao and Wu (2007), this recursion admits a stationary solution of the 
form (3.1) and satisfies the CMC property. Further, the marginal distribution function 
of Xf can be seen to possess a bounded Lebesgue density from the properties of truncated 
normal variables and dominated convergence. If {Xt)tez is positive, the right-hand side 
of (5.2) is guaranteed to be increasing in et conditional on Xt-i,Xt-2 and the model in 
the preceding display is indeed a proper QAR model. Since the process has a very small 
probability of generating a negative observation, I therefore considered only positive 
realizations of (5.2) in order to enforce well-behaved sample paths. 

The QAR process satisfies the null hypothesis of a flat quantile spectrum for r G 
(0, 0.2] and the alternative at the other quantiles. In particular, it behaves like a sta- 
tionary QAR(l) on T e (0.2, 0.6] that exhibits enough mean reversion to regulate the 
explosive behavior of the process on r e (0.6, 1). This dependence structure induces an 
asymmetric spectral shape across quantiles, with spectral peaks of different sizes at fre- 
quency zero in the middle to upper quantiles. The QS-smoothed quantile periodogram 
of a realization with n = 900 plotted in Figure 8 illustrates this shape. As can be seen 
from Table 4, the Monte Carlo test very reliably detected the presence of the alternative 
hypothesis at r = 0.5 and 0.9 even for n — 100. At r = 0.1 the null hypothesis is true 
and, although Procedure 4.4 does not apply because the observations are not iid, the 
test was only mildly conservative. 

Example 5.6 (QAR(2) and Procedure 4.6). 1 repeated the experiments outlined in the 
previous example with the wild bootstrap test described in Procedure 4.6. I experi- 



27 



0.2 



X 



Figure 8: QS-smoothcd quantilc pcriodogram across all quantilcs of a realization of the QAR(2) process 
in Example 5.5 with n = 900 and = Sn^^^ « 31.18, normalized by the joint maximum of all 
quantile periodograms. 

merited with the block size 6„, but found that the results were not overly sensitive to 
this choice as long as the blocks were not large. I therefore settled for block sizes near 
\/n/2 and used 6„ = 5, 8, and 10 for n = 100, 200, and 300, respectively, although other 
choices are clearly possible; see Shao (2011a) for a thorough discussion. 

I used the warp-speed method of Giacomini, Politis, and White (2007) to estimate 
size and power of the bootstrap test; this method considerably sped up the simulations 
because only one bootstrap replication per Monte Carlo replication was needed. The 
results are shown in Table 5. The important difference to the preceding example is 
that the QAR(2) model (5.2) is now fully covered by the assumptions of the test; see 
Theorem 4.7. This is also reflected in the test for a flat quantile spectrum of the QAR 
process at r = 0.1, which was nearly exact for n — 300. The other results in the "Size" 
portion of the table were similar to the ones given in Table 4 for the Monte Carlo test. 
The power of the bootstrap test was also comparable to the other test, but neither of 
the tests dominated the other: For the AR model both test behaved similarly, for the 
stochastic volatility model the bootstrap test showed a more balanced performance, and 
for the QAR model the Monte Carlo test was more powerful at the outer quantiles. 

6. Conclusion 

In this paper I introduced quantile spectral densities that summarize the cyclical behav- 
ior of time series across their whole distribution by analyzing periodicities in quantile 
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Table 5: Rejection frequencies of the null hypothesis for the bootstrap Cramer-von Mises test (Procedure 
4.6) at the 5% level. The block sizes are 6„ = 5, 8, and 10 for n = 100, 200, and 300, respectively. 



n 


T 




Size 






Power 




xi 


Ex. 5.4 


QAR 


Ex. 5.1 


Ex. 5.4 


QAR 


100 


0.1 


0.027 




0.026 


0.113 


0.056 






0.5 


0.055 


0.094 




1.000 




0.999 




0.9 


0.029 






0.170 


0.110 


0.374 


200 


0.1 


0.031 




0.030 


0.430 


0.339 






0.5 


0.058 


0.083 




1.000 




1.000 




0.9 


0.049 






0.486 


0.422 


0.635 


300 


0.1 


0.050 




0.051 


0.754 


0.550 






0.5 


0.056 


0.090 




1.000 




1.000 




0.9 


0.052 






0.780 


0.567 


0.820 



crossings. I discussed robust spectral estimation and inference in situations where the 
dependence structure of a time series is not accurately captured by the auto-covariance 
function, in particular when the time series under consideration is uncorrelated or heavy- 
tailed. 1 established the statistical properties of quantile spectral estimators in a large 
class of nonlinear time series models and discussed inference both at fixed and across all 
frequencies. Monte Carlo experiments showed that quantile spectral estimates are simi- 
lar to regular spectral density estimates in both shape and interpretation when standard 
conditions are satisfied, but can still reliably identify dependence structures when these 
conditions fail to hold. 

Appendix 

Throughout the Appendix, P* and E* respectively denote outer probability and outer 
expectation (see, e.g., van der Vaart, 1998, p. 258). Probability and expectation condi- 
tional on the observed sample S„ is abbreviated by P(-) := P(- | S„) and E{-) := E(- | S„). 

A. Proofs 

Proof of Proposition 3.1. By assumption, we can find a 5' > such that Fx is Lipschitz 
on (^o(r) - 5', Co(r) + 5'). Choose a large enough N en such that 5 := 5' - > 

and pick any ^ G X^((5); then, for all n > N, apply the pointwise bound |1{X„ < 
- 1{X* < ^}| < l{\Xn - CI < \Xn - the Markov inequality and the CMC 

property to see that 

\\i{x, < o - i{x[ < e}f < P(|x„ - ei < |X„ - X'J) 

< p(|x„ -^\< + - 
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for a large enough absolute constant M. This constant can be enlarged slightly to ensure 
that the inequahty also holds for the remaining n < N. Let a :— take 
suprema over Xr{S) to establish the desired result. □ 

Proof of Theorem 3.2. Let Zt^n = {Xt,t\n) and define the maps h^{Zt,n) = < 
^}cos(tA„) and h^{Zt^n) — ^{Xt < ^}sin(iAn). The empirical process evaluated at some 
function h is denoted by 

n 

t=i 

The finite Fourier transform at nonzero natural frequencies is invariant to centering. 
Hence, we can decompose Ylt=i Vt{T)^~^^'^"' into 

n 
t=l 

I will show that the first two terms in the display are small in probability; here 1 focus 
on the first term because all of the results below can be obtained for h* by replacing 
cosines with sines. Because the decomposition above and all of the following arguments 
also continue to hold for the frequencies A„ + 2Tr j/n with \j\ < k, 1 consider only the 
frequency A„. The last term of the preceding display can then be taken care of with 
results for regular periodograms; see below. 

Define a norm on Xr{S) by p{h^) := supj^gj^ Fix some e > and take 

a grid of points Co(t) - 5 = < 6 < • • • < = ^o{r) + 5 so that 6fc(^t,n) : = 
{h^^{Zt,n) - hk-ii^t,n,))/ cos{tXn) Satisfies p{bk) < e for all A; = 1,...,N. Given a 
^ e Xr{S), we can then find an index k such that \h^ — h^^_^\ < bk- In addition, for a 
ball ~ ^) C + ^) about ^ with radius r, we have 

SUpE* sup MZt,n)-h'iZt,n)\^ <E* SUp | l{Xo < ^1 - 1{^0 < Ol' 

<Fx{^ + r)-Fx{^-r), 

which is 0(r) as r — > due to Lipschitz continuity. The parametric class "K :— {h^ : 

^ G Xr{S)} therefore has bracketing numbers (sec Andrews and Pollard, 1994; van der 
Vaart, 1998, pp. 270-271) with respect to p of order iV(e, JC) = 0{e-^) as e ^ 0. 

By the same calculations as in the preceding display, there is some M > such that 
all e Xr{6) satisfy p{h^ - < M{\^ - ^'\'/'). Hence p{h^^^^^ - h^.^r)) in 
view of Lemma A.l below. It follows that v^iji^^^^^ — /ig(,(r)) -^p if for each e, e' > 
there is an > such that 

limsupP(|i/„(/i|^(^) - h^,^r)) I > e,p(^|„(r) - K{r)) < v) 
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< limsupP* ( sup - I > e J < e' (A.2) 

n->oo VeeX^(5):/9(ft5-/ijQ(^))<»7 / 

The first inequality is immediate. For the proof of the second inequahty I mimic the 
proofs of Andrews and Pollard's (1994) Theorem 2.2 and Lemma 3.1; their arguments 
do not apply directly since Andrews and Pollard work with strongly mixing arrays. The 
results are given in Lemmas A.2 and A. 3 below. Use the Markov inequality and Lemma 
A. 3 in (A.2) to see that i^n{h^o{T) ~ ^^„(r)) ^- Replace cosines with sines in the proofs 
of Lemmas A.2 and A. 3 (with the same bracketing bounds bk as above) to reach the 
same conclusion for i^„(/i| / — h*^ J. Hence (A.l) and continuity imply 
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More generally, this holds at the frequencies + ^nj /n with \j\ < k, and therefore 
it remains to show that the first term on the right-hand side of the preceding display 
converges jointly across A„ + 27ij/n, \j\ < k, in distribution to exponential variables 
with mean 5'r(A). To apply Corollary 2.1 of Shao and Wu (2007), recall that Vt(r) is 
a bounded mean-zero variable and hence the only condition that has to be checked is 
St^o I -^o) — IE(Vt(r) I 3^-i)|| < oo. By the conditional Jensen inequality, the 

law of iterated expectations, and Assumption C, this summability condition is satisfied 
because 

||E(V;(r) I %) - E{Vt{T) I 3^_i) II = ||E(l{Xi < Ur)} - < ^r)} I % 

<\\l{X,<io{T)}-l{X; <Ur 



The joint convergence asserted in Theorem 3.2 follows. □ 

Lemma A.l. Suppose Assumptions B and C hold; then \/n{Cn{T) — Co(t)) = Op{l). 

Proof of Lemma A.l. Arguing as in the proof of Theorem 1 of Wu (2005) , use the Jensen 
inequality and the law of iterated expectations to deduce that 

\\E{i{Xt < o I %)-m{x: < o 1 9^-1,4)11 

= \\E{l{Xt<^}~l{X: <0\%,e*)\\ 

< WHx, < c} - i{x; < 

Taking suprema over Xr{S) shows that Assumption C implies condition (7) of Wu (2007) 
and his Theorem 1 then yields the desired result. □ 

Lemma A.2. Let (l){h^ — h^') :— p{h^ — /i^/)^/*^^"'"'^) for some 7 > and suppose that 
Assumption A holds. For all n e N, all e Xr{Si), and every even integer Q > 2 we 
have 

E| Unih^ - %)|^ < n-^/2^((0(/ij - h^^fn) + ■■■ + {4){h^ - h^>fnf^), 
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where C depends only on Q, 7, and a. The inequality remains valid when — h^i is 
replaced by bk for any given k > 1. 

Proof of Lemma A. 2. It suffices to show the inequahty given in the Lemma after dividing 
both sides by 49 to ensure that the absolute value of 

is bounded by 1. The on the right hand can be absorbed into C . Define H[ in the 
same way as Ht but replace Xt with X[. Here 1 suppress the dependence of Ht and H[ on 
n, ^, and ^' because they are irrelevant in the following. Also note that Eifj = Eifj = 
for alH, n e N and all ^, ^' e X^(5) because Xt and X[ are identically distributed. 

For fixed A; > 2, d > 1, and 1 < m < k, consider integers ti < ■ ■ ■ < < t„i+i < 
• ■ ■ < tfc so that tm+i — tm = d and define ak{\n) = 12"^^ HiLi Since Ut : = 

Ht/ cos(tA„) and t// := H^/ cos(tA„) are stationary, repeatedly add and subtract to see 
that 

lEHt^Ht^ ■■■Ht^- MHt^Ht^ ■ ■ ■ Ht^MHt^^^ • ' • Ht^] 

= afe(A„) \EUt^^tmUt2-tm • • • Utk-tm " '^Uf^-t^Ut^-t^ ' ' ' UoEUd ■ ■ ■ Uti^-t^ I 
< ak{Xn) \m,-t^ ■ ■ ■ Uo{Ua - • • • Ut.-tr. I 

k—m—l 

+ J2 <^k{Xn)\m,-t^---UoU',---{Ut^^^.t^-Ui^^_tJ---Ut,-tJ (A.3) 

i=2 

+ a,(A„) \EUt,_tr^ ■■■UoK--- - EUt,_t^ ■ ■ ■ U^EU, ■ ■ ■ Ut.^t^ \ 

In particular, the last term on the right-hand side is zero because Ut^^t^ ' ' 'Uq and 
U'^ - ■ ■ U't^_t^ are independent and U'^ - ■ ■ U[^_t^ and Ud - ■ ■ Ut^-t^ are identically dis- 
tributed. 

By Assumption A, ||C/, - C/^||, < \\l{Xd < ^ - < Oh + llU^d < O - 1 W < 
^'}\\s < 2sup^gx,(5) < - H^'d < oils < C'a'^ for some C > 0. Here the choice 

of s does not matter because Assumption A still applies when || ■ || is replaced by || ■ 
for any s > 0; see Lemma 2 of Wu and Min (2005). Holder's inequality then bounds the 
first term on the right-hand side of the preceding display by 

ll-f^ti • ■ ■ Htm\\p\\Ht^^2 ■ ■■HtJlqC'a'^, (A.4) 

where the reciprocals of p, q, and s sum to 1. Proceeding as in Andrews and Pollard 
(1994), another application of the Holder inequality yields 




1/ (mp) 



whenever mp > 2 and similarly \\Ht^_^_2 • ■ ■ HtJ\q < (f){h^ — h^')'^'^'^'^^'^ whenever {k — 
m — l)q > 2. Suppose for now that A; > 3. li k > m + 1, take s — (7 + (5)/7 and 
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mp = {k — m — l)q = (A; — 1)/(1 — 1 /s). Decrease the resulting exponent of 4>{h^ — h^r) 
from Q{2 + 7)/(Q + 7) to 2 to see that (A.4) is bounded by C'a'^^{h^ - h^^f. li k > 2 
and k = m + 1, the factor \\Ht,^^^ ■ ■ ■ Ht,, \\q is not present in (A.4), but we can still choose 
s — (7 + (5)/7 and mp = {k — — 1/s) to obtain the same bound. Since the same 
argument also applies to each of the other summands in (A. 3), we can find a constant 
M > such that 

\EHt,Ht, ■■■Ht,\< \EHt,Ht, ■ ■ ■ Ht^EHt^^^ ■ ■ ■ Ht,\ + Ma^ih - h'f- 

Here M in fact depends on /c, but this does not disturb any of the subsequent steps. 

Now replace (A. 2) in Andrews and Pollard (1994) by the inequality in the preceding 
display. In particular, replace their Sa{dY^^ with Ma'^ and their with (j){h^ — /i^')^- 
The rest of their arguments now go through without changes. 

The inequahty for hk follows by letting A„ = 0; this is not a contradiction to the 
assumptions of Theorem 3.2 because this proof is valid for any sequence (A„)ngN. □ 

Lemma A. 3. Suppose the assumptions of Theorem 3.2 hold. For every e > 0, there is 
an r) > such that 

limsupE* I sup I Vfiih^ ~ h^')\ I < ^■ 

n^oo \^,i'&Xr{Sy.p{h^-h^,)<r] J 

Proof of Lemma A. 3. Pick an even integer Q > A. In view of the Jensen inequality and 
continuity, it then suffices to prove the Theorem with 2 in the exponent of the preceding 
display replaced by Q. I do this by following Andrews and Pollard's (1994) proof of 
their Theorem 2.1. (In fact, the arguments work for any fixed exponent since Q can be 
chosen as large as desired.) 

Andrews and Pollard's proof requires three steps: (i) Their "Proof of inequality (3.2)," 
(ii) their "Proof of inequality (3.3)," and (iii) their "Comparison of pairs" argument. 
Replace their i with k and their T{hi) with 4>{bk); then apply Lemma A. 2 above instead 
of Andrews and Pollard's (1994) Lemma 3.1 in the derivation of their inequality (3.5) 
to deduce 

max I Un bk 

l<k<N 

and use this inequality in (i) instead of their inequality (3.5). In (i) Andrews and 
Pollard also require the finiteness of the bracketing integral x~'^/'^'^'^'^^N{x,'Ky^^ dx, 
which follows immediately by choosing 7 = (Q — 2)/2. Another application of Lemma 
A. 2 establishes the required analogue of Andrews and Pollard's inequality (3.5) used in 
(ii). The same inequality can also be applied in (iii). The other arguments remain valid 
without changes. □ 

Proof of Theorem 3.6. Denote by f„,^(j) = ^t{r)Vt-\j\{r) and 

\j\<n 



max< n ' , max Oi, 

I l<fc<Af 
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the infeasible sample auto-covariance and smoothed quantile spectrum, respectively, 
based on the unknown quantile ^oIt). The triangle inequality and |^(-, •)! < 2 yield 

27r sup |^n,T(A) - ^„,r(A)| 
Ae(— 7r,7r] 



1 " 

\j\<n t=\j\+l 

" 
<-Y.\w{j/B^)\ {\Vt{T)-V,{T)\ + \V,.y\{r)-V,.y\{T)\) 

\j\<n t=\j\+l 

2 " / 

l^(.?'/^n)| [^{\Xt-Ur)\<\Ur)-Ur)\} 



+ l{|X,_y|-«o(r)|<||„(r)-^o(r)|}) " 



Consider the first indicator function on the right-hand side of the preceding display 
and recall that \/n(^„(r) — Co(''")) is uniformly tight by Lemma A.l. For a given e > 
and 77 > 0, the Markov inequality implies for large enough M > 

hmSUpPf ^ Y Hj/Bn)\ Y h\Xt-io{r)\<\Ur)-Hr)\} ^ 
™ \j\<n t=b1+l 

< limSUpPf ^ Y \^U/Bn)\ Y l{|^t-$o(r)|<Mn-i/2} > V 

+ supP(|a(r)-Co(r)| >Mn-i/2) 

neN 

< hmsup - Y Hj/B,,)\F{\Xo - Ur)\ < Mn'^''') + e. 

n— >oo V , ., 

\3\<n 

By Lemma 1 of Jansson (2002), the limit superior of B~^Yli\j\<n\''^i3/-B'n)\ is finite, 
and in view of the assumed Lipschitz continuity, the first term on the right-hand side 

of the preceding display then vanishes because S„n-i/2 ^0. The same argument 
applies to the second indicator function above due to stationarity. We therefore have 

SUPa |^n,r(A) - gnA^) \ 0- 

To show SUPa |^n,r(A) — Qri^)] -^p 0, I use Liu and Wu's (2010) Theorem 1, which 
applies whenever the windows w e W satisfy their Condition 1. The only two conditions 
that need to be established are the absolute integrability of w, which is immediate from 

|'"^(^)| dx <2 w{x) dx < oo, and 



limsupE^^ ^w(j75„)^ < oo. 
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Although Liu and Wu (2010) provide a specific value for the limit in the preceding 
display, its boundcdncss is in fact all that is needed for the proof of their Theorem 1. 
To this end, take M > sup^^^ such that for j > 1 



rj/Bn 

w{3/B^f < M\w{j/B„)\ < Mw{3/B^) < MB„ / w{x) dx 

J(i-1)/B„ 



n/Bn 

'{j-l)/Bn 

by monotonicity, and therefore symmetry implies 

°° fj/Bn fOO 

Yl ^OV^n)' <B-^ + 2MJ2 / ^i^) = + 2M w{x) dx, 

jei, 3=1 •^0-i)/-Bn ^0 

which is finite by assumption. This is also true for its limit superior as n ^ oo. The 
triangle inequality completes the proof. □ 

Proof of Theorem 4-i- (i) The process 5'„,t-(A) can be decomposed into 

n— 1 n— 1 

The second term side converges weakly in ^2(11) to a Gaussian process by the proof of 
Theorem 2.1 of Shao (2011a). The Continuous Mapping Theorem then yields CMn,T ^ 
as long as the L2(n)-norm of the first term of the display is eventually small in 
probability. 

To this end, define (pj — ||'0j||n and note that ('0j,'0fe) — for j ^ k. Use this 
orthogonality to write 

n— 1 2 n—1 

Let Tn^i,^) = ij ~ Fx{i)Y{n — j)/n for j > and ^ G M. Under the null hypothesis, 
we have r^(j) = = Tn,T(j, Co(t)) for all j > and, by Lipschitz continuity, there exists 
M > such that fn,r{j) ■■= rnAj^in{r)) < \Fx{in{r)) - < M\in{r) - Ut)\^ for 
all J > 0. In view of these properties, fix some K < n and bound the quantity in the 
preceding display by 

K-l 2 

2^ Y. (^".-(•^') " ^"'-(•^') - (^-'-(•^') - ^-(•^'))) (^•^) 

K-l n-1 

+ 2n Y rnAjfVj + nYi'^nAj) ' ^n,r(j)) V?- (A.6) 

i=i j=K 

Fix e,e' > and let e^j{Xt,Xt+j) := Vt{T,^)Vt+j{r,^). For given j > 0, take p{e^j - 
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e^/j) as the distance of ^ and ^' on Xr{S), where p is as in the proof of Theorem 3.2. 
The distance also depends on r, but this is irrelevant in the following. Note that p{e^j — 
e^'j) < 2||l{Xo < C} ~ l{-^o < uniformly in j > by stationarity. In particular, 
there is some C > such that for any rjj > 0, 1 < j < K, we have 



K-l 

^ U iPi'^Ur), - ^€o(r),,) > VA < P(|4(r) - Ur)\'^' > = o{l). 



3=1 



Given the assumption of the Lemma, it is possible to write \/{n — j)/n Vn-j ^(o(t) = 
VnifnAj) - ^rii)) and y^{n- j)/n ^n-je^^^^-^ = ^/n{fn,r{j) - fnAj))^ where I use the 
notation from the proof of Theorem 3.2. In view of the preceding display, (A. 5) then 
satisfies 

limsupPf n (rn.rO') - - (^n,r(j) - rr{j))) ^1 > e/2 ) 

< limsupP* V ( sup |i^n-j(e^j - e^^Mj) | ) > e/2 

n-^oo \eeX.(5):p(e^j-ej(,(,),,)<r,,- / / 

< - ^ lim sup E* sup I z/„_j- (e^ J - e^o(^) j) | (^J 



2e'^' o 2e' 



e — ' e 



where the first inequality is the Markov inequality, the second inequality follows from 
Lemma A. 4 below and the fourth inequahty uses </?| = 1/ (27rj^) for j > 0. 
Now consider (A. 6). The first term can be bounded by 

2nM'\UT)-Ur)\'Y.'fi' = = ^p(I)- 

By arguments as in the proof of Theorem 3.6, for a large enough M' the probability that 
the second term of (A. 6) exceeds e is less than 

V j=K ^ 

(n— 1 X n 
^ ^^[Yl ^{\Xt-Ur)\<\UT)-^o{T) 
j=K H=j+1 

l) > e/4, \Ur) - Ur)\ < M'n'V^ j + e' 



r)\} 



+ l{|Xt_^-?o(r)|<|i„(r)-&(r)|} 
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j=K H=j+1 

+ n~ 



(n—i / " \ -2 

l{|X*-?o(r)|<M'n-V2}) 
■i-t^H=j+l ^ 

n-\ , n s 2 \ 

' 5] I I] l{|x,_,-eo(r)|<M'n-V2} ) • > e/8 + e' 

16 / " \ ^ 

< — E f n-^/' J] l{|X*-eo(r)|<M'„-i/2} ) J] '^J + ^ 



e 

3>K 

which can be made smaller than 2e' by choosing K large enough. This does not affect 
any of the other bounds. Since e' was arbitrary, we have CMn,T — CMn,T + and 
CMn,T ll'S'rIln! "which proves the first result, 
(ii) Fix some K < n and decompose the statistic into 

K-l K-l 
CMn{T)/n = - fn,r{j))\] + 2 5^(r„,r(j) - rn,r(j))^n,r(j>? 

K-l n-1 
+ Yl ^nAjf^] + E ^n,r(j) V?- 

i=i j=Jsr 

The first and second terms on the right-hand side of the displayed equation converge 
to zero in probability as n — )■ oo because Vn^rij) — ^n,T{j) = Op{l) for each j under the 
assumptions of the Theorem. The third term converges in probability to '^f=i^ '^riiY^'] 
as n ^ oo. The absolute value of the last term is bounded by 16^j>^<^|, where 1 
have used the fact that |f„,T-| < 4. Hence, let if — >■ oo to conclude CMn{T)/n — )-p 
'I2j>o''^r{jYv'j > 0. The desired result now follows from a routine argument. □ 

Lemma A.4. Suppose the assumptions of Theorem 4-i hold. For every j > and every 
e > 0; there is an rj > such that 

limsupE*! sup \^n-j{e^,j — < 

Proof of Lemma A. 4- Fix some e > and take a grid of points CoiT) ~ ^ — Co < < 
■■■<Cn = + 5 so that hjiXt,Xt+j) := 2{l{Xt < Ck}-l{Xt < Ck-i})+2il{Xt+j < 
Ck} — ^{Xt+j < ^fe-i}) satisfies p{bk) < e for all A; = 1, . . . , A^. Given a ,^ G Xt-(5), we can 
then find an index k such that |e^j — e^^_i,j| < bkj. In addition, for a ball — r,^ + r) 
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about ^ with radius r, we have 

E* sup |e5,,(Xo,X,) - ee,,(Xo,X,)|2 

< 16E* sup \l{Xo < ^} - l{Xo < ^'}\^ 

<16{Fx{^ + r)-Fx{^-r)), 

which is 0{r) as r — )■ due to Lipschitz continuity. As above, for each j the parametric 
class 8,j := {e^j : ^ G Xt-((5)} therefore has bracketing numbers with respect to p of order 
A'"(e, = 0{e~^) as e ^ 0. Hence, bracketing integrals of the class Di above and the 
classes £j have the same behavior. The proof of Lemma (A. 3) therefore also applies to 
this Lemma as long as the reference to Lemma A. 2 is replaced by Lemma A. 4 below. □ 

Lemma A. 5. Fix some 7 > and suppose that Assumption A holds. For all n eN, all 
j < n, all e 'Xt-{S), and every even integer Q >2 we have 

E| ly^^jie^j - e^,j)\^ < {n - jy'^/'C {{cf>{e^, - e^,j)\n - j)) 

+ -.- + (</>(e,,,-e,,,r(n-j))Q/2), 

where C depends only on j, Q, 7, and a. The inequality remains valid when e^j — e^/j 
is replaced by bkj for any given k > 1. 

Proof of Lemma A. 5. As in the proof of Lemma A. 2, it suffices to show the inequality 
given in the Lemma after dividing both sides by 8"^ to ensure that the absolute value of 

Et,t+j := {e^A^t,Xt+j)-e^,j{X„X,+j) - (Ee^,,(Xo, X,) - Ee^, ,(Xo, X,)))/8 

is bounded by 1. Define -E't^j+j in the same way as -E't,t+j but replace Xt with X^ and 
Xt+j with X^_^j. For fixed k > 2, d > 1, and 1 < m < k, consider integers ti < ■ ■ ■ < 
tm < tm+1 < • • • < ^ik SO that tm+1 — tm — d. Repeatedly add and subtract to see that 

|lE-Et^,ti+j • • • Et^^tk+j — IE-E'ti,ti+j • • • Et^^tm+j^Et^^^^tm+1+3 ■ ■ • Etk,tk+j\ 

— l^-E'tl-tm-jjtl-tm ■ ■ ■ Et^.-tm-j,tk-tm 

— '^Et^_tm-j,tl-tm ■ ■ ■ E-jflEEl_j^l ■ ■ ■ Et^^tm-3,tk-tm \ 

< \EEt^_t^^j^t^^t^ ■ ■ ■ E_jfl{Ed-j,d - E'd^j^d) 

^ jftm-\-2 tm '^^k tm Jj^fc tm \ 

k—m—1 



+ ^ \EEt^-tm-j,tl-tm • • • E-jfiE'^_j^^ X 



i=2 



^ ^Etm-\-i tm ji^m+i tm '^tm+i~tm~jytm+i~tm^ Et^ tm j-itk tj, 

+ \EEt^_tm-j,tl-tm ■ ■ ■ E-jfiE'a_j d ■ ■ ■ E't^_tm-j,tk-tm 

"^"^^1 tm. j^ti tm. j,0^-^l j^l "^tif; tm. Jj^fc tm. I ' 
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where the last term on the right-hand side can again been seen to be zero. 

Since j is fixed, it is possible to write \\Ed-j,d — E'^-j^d\\s ^ 2||l{Xd < O "~ < 
^}||, + 2\\l{Xd-j < O - HK-j < Oils < C'a%l + a-^) for some C > 0, where the 
cases where d < j were absorbed into C. The same can then be done for (1 + a^^). 
Hence, proceed exactly as above to find a constant M > such that 

< \EEt^^ti+jEt^,t2+j ■ ■ ■ -^tm,tm+j^-^Wl,*m+l+:/ ' ' ' ^tk,tk+j\ 

The rest of the arguments in the proof of Lemma A. 2 now go through without changes. 
The proof for the bounding functions bk is almost identical and therefore omitted. □ 

Proof of Corollary 4-5. (i) Theorem 1 of Lifshits (1982) guarantees that ||>S'^||n has a 
continuous distribution function, and therefore c„^t-(1 — a) — > Coo,t(1 — by Lemma 
21.2 of van der Vaart (1998), where Coo,t is the quantile function of ||5'^||n- Hence, 
CMn,T — c„,r(l — tt) ^ ll'S'TlIn ~ Coo,r(l " Oi) and, in particular, ||5'T-||n — Coo(l — ol) also 
has a continuous distribution function. This in turn implies 

|P(CM„,^ > c„,^(l - a)) - q;| 

= \^CMn,r < Cn,r(l - «)) - P(||-S,||^ < Coo,r(l " «)) | ^ 0. 

(ii) Let CM^^T := ^,>n ^T(j)^<^j and pick an e > such that CMoo,t — e > 0. By 
Theorem 4.1(ii) and the properties of quantile functions, 

^{CMn,r < Cn,r(l - «)) < l(Cn,r(l " «) > n{CM^^r - c)) 

+ P(|CM„,,/n- CMoo.rl > e) 
= 1(1 - a > F{CM'^Jn < CM^,r - e)) + o(l). 

It therefore suffices to show that CM'^ .^/n — >p 0, which follows from an application of 
Birkhoff 's Ergodic Theorem to the first term on the right-hand side of 



K-l , n N 2 

j=l V t=l+j ^ j>K 



and then letting — >■ oo. □ 

Proof of Theorem 4-7. (i) Recall that rT-{j) = for all j > under the null hypothesis 
and let f^^^ij) := n"^ E"=i+i Vt{T)Vt+j{T)ujf Decompose S^^^ into 



n— 1 / n \ 

^"'^'E( E i^tir)Vt+Ar) - V,{T)V,+,{r))u;,U,{X) (A.7) 
j=i \=j+i ^ 



39 



n-1 



j=l H=j+1 



n-1 

nJ2^:,rWj{X). (A. 



As a preliminary step, I show that the L2(n)-norms the first two terms have a P- 
probabihty hmit of zero with high P-probability; the L2(n)-norm of the third term 
converges P-weakly in P-probabihty to ||>S'^||n by Shao's (2011a) Theorem 3.1. I then 
use these results below to prove that the bootstrap test has asymptotic size a. 

The P-expectation of the square of the L2(n)-norm of (A. 7) can be written as 



E 



n— 1 / n \ 
1=1 H=j+1 ^ 



=i+l 

n— 1 L 



n 



= E ^1 E ( E (^t(r)V^,+,(r) - V;(r)V;+,(r)) 

j=\ s=l \eBsn[7+l,n] 

Fix e, e' > and pick a large enough M > such that sup^jgpjP(|Xo— ^o(''')| > Mn^^/^) < 
e'. As in the proof of Theorem 4.1(i), the probability that the term on the right is larger 
than e is at most e' plus 



16 

e 



n— 1 L„ / 



'■{|'^t-«o(T)|<Mn-i/2} 



1 \eSsnb'+i,n 



n-1 



<^(0(n-V2) + 0(Wn))E^?' 

i>o 



which can be made arbitrarily small by letting n — )■ oo and then M — )■ oo. 

Now consider the P-expectation of the square of the L2(n)-norm of (A. 8), which can 
be written as 



E 



n-1 



n 



"/^E^-(^-) E-. W. 



t=j+i 



n— 1 



n 



n-^5^f„,.(j)V|5^Ef J] a;, 

i=i «=i \eSsn[i+i,n] 

n-1 



n— 1 



n-1 



j=l j=l 
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by the Loeve Cr inequality. The first term on the right-hand side of the display converges 
to zero in probability by arguments similar to those given in the proof of Theorem 4.1(i) 
provided that bn/n — )■ 0. The second term is Op{hn/n) by Corollary 2.1 of Shao (2011a). 
It follows that CM*^^ ^ ||>S'r||n probabihty. 

Theorem 1 of Lifshits (1982) and Lemma 21.2 of van der Vaart (1998) then give 

Cn,r(l - Oi) -^p Coo(l " «). ThuS, CMn,T " Cn,r(l - «) -w ||5'T-||n " Coo,r(l " «), which 

yields 

|P(CM,,,><,(1 -«))-«! 

= \^CMn,r < <.(1 - «)) - ^{\\Sr\\l < Coo,r(l - «)) | ^ 0. 



ii) Recall that CMoo,t = ^j>o^T(j) v?^- Pick an e > such that CMoo.t > e and, 



as 



in the proof of Corollary 4.5(ii), the properties of quantile functions and Theorem 4.1(ii) 
imply 

F{CMn,r < c;,(l - a)) < P(c;,(l -a)> n{CM - e)) 

+ P(|CM„,,/n-CMoo,r| >e) 
= P(l - a > P(CM;^/n < CMoo.r - e)) + o(l). 

Hence it suffices to show that ECM* -^p 0, which is seen from 

n-l Ln / \ 2 



n '-^ •' 



almost surely and bn/n ^ 0. □ 
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